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Abstract 



£^ ■ Contact matrices provide a coarse grained description of the con- 

(y-j ■ figuration w of a linear chain (polymer or random walk) on Z n : 

CO ! Cij(oj) = 1 when the distance between the position of the i-th and 

j-th step are less than or equal to some distance a and Cij(uj) = 
otherwise. We consider models in which polymers of length N have 
CN , weights corresponding to simple and self-avoiding random walks, SRW 

and SAW, with a the minimal permissible distance. We prove that to 
leading order in N, the number of matrices equals the number of walks 
for SRW, but not for SAW. The coarse grained Shannon entropies for 
| SRW agree with the fine grained ones for n < 2, but differs for n > 3. 

a' 



1 Introduction 



The use of coarse grained descriptions is essential for systems with many 
degrees of freedom. The choice of the coarse grained variables is dictated 
by the nature of the system and by the questions of interest. One is then 
interested in the amount of information lost in the coarse graining, at least 
in some statistical sense 0. 

In this paper we shall study this question for simple models of polymers, 
large molecules consisting of a linear sequence of N monomer units. A re- 
duced description of this system can be based [3 HU [HJ HH] on associating 
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Random walk lo\ 



Random walk 002 



Figure 1: The two random walks U\ and 002 have the same contact matrix 
because they have the same self-intersections: 2 — 10 and 13 — 17. 

to each polymer configuration a connectivity or contact matrix C, such that 
Cij = 1 or depending on whether the distance between the position of the 
i-th and j-th monomer, and is smaller or bigger than a certain 

specified value a, 



This coarse grained (see Figure Q]) representation of the structure of proteins 
is often used for numerical studies of protein folding. The very minimal- 
ist nature of this representation permits a rapid first search for a protein's 
native structure in terms of its contact matrix. In fact, knowledge of the 
contact matrix can predict many features of the vibrational spectra of cer- 
tain proteins |3] . This makes it important to have information about the 
relation between the space of contact matrices and that of the proteins they 
represent. 

To answer the question of how much information about a polymer is 
retained by its contact matrix, we consider an idealized version of the geo- 
metrical structure of a polymer in which the monomers occupy sites on the 
n-dimensional lattice Z n , and consecutive monomers are on nearest neighbor 
lattice sites. 

We compare the Shannon entropy after coarse graining, Sc{N), with 
the Shannon entropy without coarse graining S(N). To quantify the loss 
of information in the coarse graining we consider 5^ = Sc/S. We prove, 
for SRW on Z 2 , that 5^ goes to one as N becomes large, showing that the 
relative loss of information (5* — Sc) / S vanishes. This is a consequence of 
the recurrence of SRW on Z 2 . Moreover we provide some bounds on finite 




1 if -u(j)\ < a, i^j, 
otherwise. 
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size corrections (see flOEJ)). On the other hand, for SRW in Z n , n > 3, and 
for SAW in Z n , n > 2, 5n remains strictly less than one, which means that 
the loss of information due to the coarse graining becomes substantial (see 
(j3~2bj) and Theorem EPj) . 

We also consider the problem addressed already in [Hj, i.e. how the num- 
ber of different physical contact matrices W(N) depends on the polymer 
length N. It was shown there analytically that W(N) increases exponen- 
tially in N, and numerically that the growth exponent 7 at is strictly less 
than the growth exponent for the total number of SAW. In our work we 
give a rigorous proof of this (see Theorem 13.3)1 . We also consider the same 
problem for SRW. Surprisingly, the growth exponent for the contact matrices 
is now the same as for the number of SRW in all dimensions. The reason 
for this is that the probability distribution of the number of distinct visited 
sites (divided by N) has a left tail which does not decay exponentially fast 
in N. To conclude, we provide lower bounds on 7^, relevant to the finite size 
behaviour (see (j3.1a|) and (|3.2a|) ). 

The outline of the rest of the paper is as follows. In sectionEJwe introduce 
the model, the relevant quantities and the studied examples. The main 
results are presented and briefly discussed in section El Sections IH El and El 
are devoted to the proof of the main results. 

2 Preliminaries 

We define more precisely the quantities and the examples which will be stud- 
ied, fljv is the set of all polymers containing N + 1 monomers, where the 
configuration of such a polymer is specified by un = (u(0),uj(1), ...,u(N)) 
with co>(0) = and u(i + 1) — u(i) = ±e a , where e a is one of the unit direc- 
tions on Z n , a = l,...,n. Let there be given some probability distribution 
¥(u) on Q. (We shall drop the subscript N whenever possible.) The contact 
matrices C = {C(u;)} we o partition Q into sets Qc = : C(u) = C}, with 



the number of configurations 00 G fie- The probability of ui being in ttc is 
then 



To measure the information lost in the coarse-graining we may compare the 
Shannon entropy Sc of the coarse grained measure P(C) with the fine grained 



degC= \n c 



(2.1) 




(2.2) 
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entropy S, 

S = -^P(w)lnP(w). (2.3) 

wen 

We then have 

sc = -Yl inP(c) = -E E p ( w ) ln (w§ p ^ 

= S-S c (2.4) 



where 



with 



S c = - E P ( C ) E P M C ) 1bP(w|C) (2.5) 

cec wen c 



¥(u\C) = P(cj)/P(C) for lu G fie (2.6) 

is the conditional probability of uo given that it is in Qq- We can thus think 
of Sc as the average "conditional entropy" relative to C. Since Sc > we 
clearly have Sc < S, and S — Sc is then a measure of information lost in 
the coarse graining 2|. The question is how much. In particular we may ask 
how does 

Sn = S c /S (2.7) 

behave as iV — > oo. 

Before answering this question we note that 

Sc<S c = \n\C\, (2.8) 

where Sc is the entropy of the distribution P(C) which assigns equal weight 
to all C E C, i.e. P(C) = W(N)~\ with W(JV) = |C| the total number of 
different coarse grained components, i.e. contact matrices. We may then also 
define 

7tv = Sc/S (2.9) 

and by S 

5iv < In < 1. (2.10) 

The last inequality is obtained by replacing P(C) by P(C) in (|2.4|) - (|2.6|l and 
using that the corresponding Sc is positive. 

So far everything is completely general. We shall now specialize to the 
case where all permissible configurations u, i.e. all those for which ^ 0, 
have the same probability. Then 

W(N) = |C| =EVdegC(w) = |fi|E(degC)-\ (2.11) 
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where E is the expectation value with respect to the relevant uniform distri- 
bution. 

The examples we shall consider here are: 

1) The weights are those of simple symmetric random walks (SRW) on 
Z n , i.e. |0| = {2n) N and P(w) = (2n)~ N for all u. 

2) The polymers behave like self avoiding walks (SAW) on Z n , i.e. the 
configuration space ft consists of all uj s.t. uj{i) ^ u(j) for i ^ j, and F(u) = 

where \fl\ ~ $ AW is the number of SAW on Z n of length N. SAW 
model the steric exclusion effects of the monomers and are frequently used 
as a model for polymers |3J Ej. 

3) The chains behave like bond self avoiding walks (BAW) on Z n in which 
case fl consists of all ui such that the pair [u(i),u(i + 1)] ^ [u(j),uj(j ± 1)] 
for i ^ j, and P(w) = ~ /4aw; the number of BAW 

Note that for uniform distributions S is just the logarithm of the to- 
tal number of configurations, i.e. S = ln|0|, so 7^ is just the ratio of the 
logarithms of the numbers of contact matrices and random walks. 

The behavior of 7^ was studied in [TJ] for the case of SAW, with 

CH = (i ifKi)- w (j)| = i,|*-j|>i, {2A2) 

n ' \ otherwise. v ' 

(The inequality can in fact be made an equality here since 1 is the minimal 
distance between u){t) and ui(j) for i 7^ j.) Numerical studies pi] for n = 2 
indicated that 7^ remains strictly less than 1 in the limit iV — > 00. It is then 
natural to ask whether the same is true for the SRW when we again define 
Cij(u) = 1 when uj{i) and u(j) are as close as they can be, i.e. when a in 
is set equal to zero 

O) = { I fo r l41-^)ho,^j (213) 

n 1 \ otherwise v 1 

Note that for this case W(N) satisfies 

W{N X + N 2 ) > W(N 1 )W(N 2 ). (2.14) 

Since for SRW 7^ = ^^ 2 ^| it follows from (|2.14J) that 7^ is monotone 
non-decreasing in N and thus that Iini7v^oo77v exists: remember 7^ < 1. 

In the present work we prove some results about 5n and 7^ for all the 
above examples. (Some of these generalize readily to other uniform distribu- 
tions.) 
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3 Main Results 



Theorem 3.1. For SRW on 1? , there exist constants k, Ki, K2 > such that 
for large N, 

ft In iV . 
7jv - 1 iVVs - ' (3 ' la) 

1 " 7^7 < 5 * < 1 " 7T^- ( 3 - lb ) 
In A (IniV) 2 v ' 

Consequently, 7 at and Sn — > 1, as N — > oo. 

Theorem 3.2. For SRW onTU 1 , n >3, there exist constants K n , K f n > such 
that for N large enough, 

^n>1- N2 *" n+2 y (3-2a) 

8 N <1- ««■ (3.2b) 
Hence, —> 1, as N —> oo u>/w/e limsup^^^ 5a? < 1. 

Theorem 3.3. For SAW and BAW, on 7L n , n>2, lim sup^oo 7jv < 1 and 
ipso-facto lim sup^^oo 5n < 1- 

We are indebted to Harry Kesten for the key idea in the proof of Theo- 
rem EH!! for SAW. The extension to BAW is straightforward. 

Intuitively we expect that the larger the number of intersections, the more 
information is contained in the contact matrices. It is therefore not surprising 
that for recurrent RW, such as SRW in n = 2, both 77V and 5^ would go to 
1. (For the degenerate case n — 1 there are, for all the cases, just twice as 
many random walks as contact matrices corresponding to whether the first 
step is to the right or the left.) One expects however that for RW which have 
a strong tendency to spread out such as SAW in n > 2, and SRW in n > 3 
the contact matrices lose too much information. This is indeed reflected in 
Sn < 1 for SAW in n > 2, and SRW in n > 3. The same is true for 7^ for 
SAW. Surprisingly however 7^ —>■ 1 for SRW in all dimensions. The reason 
for this, as we shall see, is that the probability that an SRW of length in 
Z n visits Rn < eA distinct sites goes to zero slower than exponentially when 
N — > 00 for any fixed e > 0. 

The outline of the rest of the paper is as follows. In section 0] we first 
present some general inequalities and then prove the results about 7^ for 
SRW. In section El we give some bounds on the degeneracy of SRW and then 
prove the results about 5n for SRW. In section |H1 we prove Theorem 13. 31 for 
SAW and BAW. 
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4 Proof of Results for 7^ 



Some Inequalities 

Using Jensen's inequality on the average over Qc, gives 

E &rc( ¥ ^ de s c ) ln ( p ^) de § c ) ^ E ¥ ^ ln ( E p h) 

= P(C)lnP(C). (4.1) 



Writing now 



we obtain 1 
which yields 



S = - E P ( w ) (ln(P(o;) deg C) - ln deg C) (4.2) 



5 < S c + E(lndegC), (4.3) 
E(ln degC) 



s , <S N <1. (4.4) 

We next give an upper bound for the degeneracy of the contact matrices 
defined in (12.13)) . This is the key to our results for SRW. Note that for the 
examples considered here, the first inequality in ()4.4j) is indeed an equality. 

Define the range R n of a SRW with length iV to be the number of distinct 
sites visited by the walk. 

Lemma 4.1. Let u have a range Rn = M and letC{ui) be its contact matrix. 
Then 

degCH < (2n) M . (4.5) 

Proof. The contact matrix C(u) has N + 1 — M columns with "1" 's in the 
upper triangular part, because we have N + 1 — M intersections. Let us now 
construct all random walks uj' such that C(u>') = C(u). Consider now u'(k) 
with k 0, then there are two possible cases: 

1. there exists a i < k such that Cik{ui) = 1, 

2. for all i < k, Cik(ou) = 0. 

In the first case, u'(k) = uj'(i) and therefore we have only one choice 
for it. In the second case, the k th step will occupy a place which was never 
occupied before. Therefore we have at most 2n possibles choices. 



1 The right side of (|4.H|> is the maximum of the entropy over all measures, fi((j) such 
that n{C) = P(C). 
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For a uj' with a contact matrix C(uj') = C(u>) there are M — 1 steps for 
which we are in the second case because the starting point is fixed at the 
origin and N + 1 — M steps for which we are in the first one. Therefore 
there are at most (2n) M_1 < (2n) M different uj' satisfying C(u/) = C(uj), i.e. 
degC(u) < {2n) M . □ 

Proof of (I3.1aj) . We first note that 5n,Jn - > 1 for general recurrent RW, 
which includes SRW on Z 2 . For such walks E(i?jv)/iV — > as iV — > oo, 
therefore using (|4.4jl and Lemma f4.1[ 

, E(lndegC) , MR N ) , 
^ > 1 \ r1 „ ; > 1 -> 1 as iV -> oo. 4.6 

To prove (|3.1aj) consider the subset fl a of SRW on Z 2 defined as 

tt a = {uj E tt N s.t. uj(k ■ A[N a ]) = 0, k = 0, 1, . . . , AwJ. (4.7) 

where /c ma a; is the largest integer such that fc4[iV Q ] < N. Let us take 
< a < 1/2. Each w 6 O" returns to the origin after 4[iV a ] steps, it is 
therefore contained in a cube of edge length 4[iV Q! ] (except eventually for the 
last 2[iV a ] steps). Using Stirling formula we have, for some K > 0, 

P ^ 4M ) = °) ^ (WW^ * KM ~ 3/2 (4 ' 8) 

because {uj(AM) = 0} D {w(4M) = with M steps in each direction}. 
Then 

F(R N < = A 2 [N a ] 2 + 2[N a ]) > F(uj e tt a ) 

> (K/[N a ]V 2 ) N/4[Na] - ( 4 -9) 
Therefore combining (|2.11j) . (|4.5jl and ()4.9j) for n = 2 obtain 

W(JV) > 4 w P( J R iV < N P )/A^ P \ (4.10) 

which implies 

In AT A 2 \N a } 2 + 2\N a ] 

for a k > 0. For a = 1/3 the RHS of (|4.11jl is optimized and the term with 
the logarithm dominates the last one. This proves the bound for n = 2. □ 
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Proof of (HOH). For n > 2, F(R N = X) ~ exp(-aiV/X 2 / n ) when X -> oo, 



(see [IH] and pp. 88-92 of [H]). Therefore for a G (0,1), P(i? 



A' 



X 
N 

N a ) ~ exp(-a^ 1 - 2a/n ). But for an with R N = M, degC(cu) < (2n) M , 
see Lemma 14.11 Therefore restricting the sum in (J2.11j) to w G Q%, we have 
W(N) > (2n) 7V P(i?jv = N a ) / (2n) N " , since the numerator is just the number 
of terms in that sum. This implies, for large N, 

1 > 7jv > 1 - v{N, a) (4.12) 

where a{N, a) = N^+aN' 201 ^ / ln2n. Choosing a G (0, 1) which minimizes 
cr(N, a) for large N, we obtain a — 1 = —2/ (n + 2). Taking K n = (ln2n + 
a) I In In completes the proof. □ 

This theorem implies that for N large we have (up to smaller corrections), 

W(N) > (2n) N (2n)-^ N " /{n+2) . (4.13) 

There exists also an upper bound on 77V which depends on the decrease 
of F(R N /N < e). 

Proposition 4.2. For all fixed e > 0, there exists a constant k' > such 
that for N large enough, 

T»<.-^«^. (4.14) 
The outline of the proof will be given in the Appendix. 

5 Bounds on the degeneracy of SRW 
5.1 SRW on I? 

For SRW on Z, 2 , Rn ~ more precisely (see e.g. [THj). 

E(i2*) = ^(l + 0(l/lnJ\O). (5.1) 

Next we apply a result of van Wijland, Caser and Hilhorst Let the 
support of uj be defined to be the set of points visited by uj. Consider two 
finite disjoint sets of lattice points A u and A v . The "pattern" centered at 
x associated with the sets A u and A v is a configuration of \A V \ visited sites 
x + z, z G A v and of \A U \ unvisited sites x + z', z' G A u . We say that the 
pattern appears in the support of uj at x if the lattice points x + z, z G A v , 
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Figure 2: The pattern Q. The visited sites of Q are black. 



are in the support of uj and the lattice points x + z', z' G A u , are not in the 
support of uj. The numbers of times that a pattern appears in the support 
of uj is then the number of different x G Z n such that it appears at x. 

Let us consider the "pattern Q" defined as the set composed of the 
following sets A v and A u : A V (Q) = {(0, 0), (-1, 0)} and A U (Q) = 
{(1, 0), (0, -1), (0, 1), (-1, 1)} (see Figure©. 

Let Qn = Qn(u) be the number of times that Q appears in the support 

of uj. Then using [IE], E(Q N ) = J 2 Wi + O ( ^ N) s ) where m x = m^Q) 

is a constant, and Qat — E(Qjv) — [^yE(QAr)7(A^) where 7(iV) is a random 
variable (Varadhan's renormalized local time of self-intersections, see jlOj ) 
with mean and variance 1. A is a constant given in jTHj whose value is 
~ 1.3034. We computed m 1 finding m x = 2.78 • 10~ 3 . 
These results imply the following proposition. 

Proposition 5.1. For simple random walks on Z 2 , there exists a v > such 
that 



Proof. Suppose that the pattern Q, centered at ( G Z 2 , exists in the sup- 
port of a random walk uj. Let us consider the following transformation: 



In other words we exchange the points ( and £ + (—1, 1). This application 
does not change the contact matrix of the random walk, because £+(—1,1) 
is connected only with ( + (—1,0). We have to prove that the probability 
of having the pattern Q in the support of a random walk at least M = 




(5.2) 



T ( : Q 
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uN/(hiN) 2 times goes to 1 as iV — ► oo. A RW with M times the pattern 
Q appearing in its support is at least 2 M times degenerate: we can apply or 
not apply independently for each ( such that Q appears in the support of 
uj (centered in (). 

We want an upper bound of P (Qn < a j\^y) ^ or 01 e ^' ^ anc ^ ^ = 
mi7r 2 . For each k > and N large enough, 

¥ ( Qn < "pf) - K Qn " e(Qn) - ~ kaQ o^w) (5 - 3) 

with ag = 2fiA. In fact, for N large enough, F,(Q N ) = + 
C(A^/(ln8A^) 3 ) and therefore for each a < 1, + 0(JV/(In 8A^) 3 ) - 

fca Q(kTW - "(iSF- Thus 



uN \ ( N 



" ^ A^ 2 ' (5 - 4) 

ft °Q(ln8iV) 6 

Therefore for each a G (0, 1) we have VA; > 0, 

lim P (q n < a-^—] < (5.5) 
at^oo ^ (IniV) 2 / A; 2 

This implies that for all a G (0, 1), lim^oo P (Qn > a j^§f) = 1. All the 
random walks with a number of pattern Q in their support more than a njijjfp 
are more degenerate than e a v Nln2 /( lnN ) 2 . Then, since a G (0, 1), for all choice 
of v < yuln2 we have (|5.2|) . □ 



5.2 SRW on Z", ra > 3 

Let us define, for n = 3, the pattern P as consisting of a set A v of visited 
sites and a set A u of unvisited sites as follows: A V (P) = {(0, 0, 0), (—1, 0, 0)} 
and A U {P) = {(1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1), (-1, 1, 0)}. 

Let P/v = Pn(&) be the number of times that P appears in the support 
of u Then using [II] we have E(P N ) = mxN + 0(y/N) and P N - E(P N ) ~ 
a pV N In Nr) (N) where mi = 2.5 ■ 10~ 3 , ap = 1.2 ■ 10~ 2 and 1](N) is a random 
variable with normal distribution A/"(0, 1). 
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Proposition 5.2. For simple random walks on Z 3 , there exists a v > such 
that 

lim P (degC(u) > e uN ) = 1. (5.6) 



iV— >oo 



Proof. The proof is very close to the one of Proposition 15.11 This time we 
exchange the sites ( and ( + (—1, 1, 0) (if P appears centered in () and we 
prove that V/c > and N large enough P(P/v < arriiN) < p if a < 1. Then 
for all choice of v < mi In 2 ()5.6|) holds. □ 

In dimension n > 4 the same result (with a different value of mi) is 
expected to hold. In fact a similar pattern in n > 4 has E(P/v) = miiV + 
OQnN) in n = 4, E(P N ) = m x N + 0(1) in n > 5 and E(P^ - E(P W )) 2 = 
KpN + o(iV), see [17 . The only point that one should prove for n > 4 is 
that mi 7^ 0. 



5.3 Proof of flHUH) and (GOE) 



The previous results on the degeneracy lead to the following results. 

Proof of (I3.1b|) . Using Propositions 15. II and ()5.1j) we obtain bounds on 5n, 
for n = 2. 

{ „<l- P (de g C>,*^^. (5.7) 

By PropositionEHJ there exists a v > such that P (degC > e uN ^ lnN ^ -> 1 
as A" — ► oo. This gives 

lim (1- 5 N )(\nN) 2 > z//ln4. (5.8) 
Consequently for k 2 < vj In 4 and A" large enough, 5^ < 1 



On the other hand, 



(In AT) 



Sn > t _ g^jg) >1 _gC^>. ,5.9) 
AHn2n ~ iV v 7 

Then 

In AT 

lim (1 - 5^) In JV < lim E(i? 7V ) = vr, (5.10) 

consequently for Ki > n and A" large enough, 5n > 1 — «i/ In A/". □ 
Proof of (fOb"|) . For all z/ > 0, 
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By Proposition 15.21 there exists, for n > 3, a v > such that P(degC > 
e uN ) — > 1 as iV — > oo. Therefore for iV large enough 

and lim sup^v^^ 5^ < 1- □ 



6 Proof of Theorem 13.3 



The contact matrix of u is now defined by (j2.12|) . 

Let us consider the case of SAW. We introduce some notation: we con- 
sider a cube D = {x G Z n s.t. q < x^' < q + 6, 1 < i < n} for 
some c= (ci,...,c n ) G Z n with its boundary dD = {x G Z n s.t. = 
Cj + 6 or a;W = 1 < % < n}. A path P is a SAW of finite length, say 
k, starting at the origin, i.e. P = {Xi(P), < i < k} with X (P) = 0. 

We consider only paths such that there exists a cube D with X (P) = 
and Xk{P) two of its vertices and Xj(P) G D for all < j < k. We say that 
(P, D) occurs at the r th step to if 

1. X r+ j(w) — X r (u) = Xj(P) for all j — 0, . . . , k and 

2. c<j does not occupy any other points of D. 

XnU, (P, D)) is the number of to G ^at such that (P, P) occurs at most at j 
steps. 

Theorem 6.1 (Kesten Pattern Theorem |8J). Let P be a SAW and D 

a cube such that D has and Xk(P) as two of its vertices and contains P. 
Then 

f XN (aN,(P,D)) \ 1/N 
hmsup I 1— ■ — ■ < 1 for some a > (o.lj 



where \Qn\ is the total number of SAW of length N. 
It is known that — f-sAw with Usaw > 0. 

Proof of Theorem 13.31 Let us take b > 2 and consider a SAW path of 
length k + 2n constructed as follows. The firsts n steps of P connect the 
points (0, . . . , 0) and (1, . . . , 1). The following k steps connect the points 
(1, . . . , 1) and (b — 1, . . . , b — 1) with a SAW remaining always in D \ dD. 
The last n steps of P connect the points (b — 1, . . . , b — 1) and (b, . . . , b). Let 
us divide the set into a sum of two disjoint parts: fl^ = U {^nY 
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where Q% = {u G Qn s.t. (P,D) occurs at most aN times} and {Q%) c its 
complementary set. It follows from Theorem 16.11 that 



3C > s.t. F(lu G n%) < e~ C7V . 

Let us take a u G (Q%) c . Then (P, D) occurs at least aN times in uo. Consider 
an occurrence of (P, D) in the piece of P between its t th and its (t + k + 2n) th 
steps. We apply an axis rotation of 2ti/ti degrees to the cube D \ dD, where 
the axis is its diagonal of direction (1, . . . , 1). This transformation does not 
change the contact matrix, and we can apply it n times obtaining each time 
a different SAW. For the chosen u it can be done independently in at least 
aN different places, therefore the corresponding contact matrix is at least 
n aN times degenerate. 

Now we have an upper bound for the total number of contact matrices: 

W(N) < F{uo G n%)\n N \ + F{uo G {Q a N ) c )\Q N \n- aN (6.2) 
< (e~< N + e- Nalnn )\Q N \. 

Defining a« = max{C, a Inn} > and a m = min{(, a Inn} > 0, we obtain 

*mN (, , „-^N 



In l e ~ amN (1 + e~— JV JJ +ln|^| 
limsup7Ar < lim — — - — — — : — (6.3) 



y-.x N ->°° In |Qjv| 

Or, 



hi f^S AW 



< 1. 



□ 



Now we consider the case of bond- self- avoiding walks (BAW). We in- 
troduce some notation: we consider a cube D as for SAW and a cube 
D 1 = {x G Z™ s.t. Ci — 1 < < Ci + b + 1, 1 < i < n} for some 
c = (ci, . . . , c„) G Z n . In this case a path P is a BAW instead of a SAW 
with the same conditions as for SAW. We consider only the paths such that 
there exists a cube D with X (P) = and A^(P) two of its vertices and 
Xj(P) G D 1 for all < j < k. We say that (P, D) occurs at the r th step u if 

1. X r+ j(uj) — X r (u) = Xj(P) for all j — 0, . . . , k and 

2. uo does not occupy any other points of D. 

XnU, (P, D)) is the number of to G VLn such that (P, D) occurs at most at j 
steps. Theorem 16 . 1 1 holds also for BAW [3]. 

Proposition 6.2. For BAW 

lim sup 7jv < 1. (6.4) 

N— >oo 
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Proof. The proof is identical to the one of Theorem 13.31 □ 

Remark 6.3. As noticed by Kesten in ' L 8l, Theorem \6. 1\ could be proven also 
for other lattices in almost the same way, therefore Theorem YS .c\ should hold 
for other lattices than Z n . 
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A Outline of the proof of Proposition 14.2 



Let In = N + 1 — Rn he the number of intersections. Consider an interval 
J = [k ,ki] C [0,1] and the subset A N (J) = {u e Q N s.t. I N (u)/N G J}. 
We define the mean degeneracy on A N (J) by (degC)j = \A N (J)\/W(N)j, 
where W(N)j is the number of contact matrices corresponding to RW with 
I N /N G JWe set d(J) = lim inf N ->oo jj m (degC)j. 

Theorem A.l. For SRW on Z n , n>2, and ix' = lim^^ E( I N )/N, 

d(J = [ko, k\}) > for all ko < tt' and ko < k\ < 1. (A.l) 

For an u G VL N) let us define F(u) to be the number of loops of length 4 
which do not intersect the remaining part of uj (called "free-4- loops" ) . 



Proposition A. 2. Let J be as in Theorem \A.l\ Then there exists an ctj > 
such that 

Pj = liminf--3-lnP{cj G A N (J) s.t. F(u) < ajN} > 0. (A.2) 
Proof of Theorem IA.lt For k < tc' and k± G (ko, 1), 

^^7^ < P{F(^) < ajN for u G A N (J)} (A.3) 
\ a n{J )\ 

+ 2- ajN ¥{F(uj) > ajN for u G A N (J)} < 2 exp(- min{/?j, a,j In 2}N) 

since a contact matrix with M free-4-loops is at least 2 M times degenerate. 
Then it follows by Proposition IA. 21 that d(J) > min{/5j, aj In 2} > 0. □ 
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Outline of the proof of Proposition IA.21 Divide Z™ into disjoint n- 
cubes of edgelength 4. First we remark that at least aN cubes are visited by 
uj E Ajv(J), a = (1 — ki)/4 n , and at least aN/2 are visited at most by 2/a 
steps. Consider A% N (J) = {to E Ajv(J) s.t. F{uj) < aN}, a < I. We do two 
successive operations on uj E Affi(J). 

1) We modify the free-4-loops so that the new RW uj has F(uj) = 0. This 
is obtained by moving the 3rd step to the position of the 1st step of the 
free-4-loops. 

2) We choose 2a N cubes out of the first aN/2 visited less than 2/a steps. 
The choice can be made in (^v) different ways, uj passes in a cube not more 
that 2/a times and at each time we replace the path inside the chosen cubes 
by another one of length increased by 2 which remains on the boundary of 
the cube and leaving the enter and exit points unchanged. Therefore the 
center of the cubes are now empty. Secondly we add a free-4-loop in the 
center of the cubes the first time that are visited by uj. 

The final RW have length n E [N(l + 20a),JV(l + c 2 a)] and I n /n < 
ki + c 2 a, c 2 = 17 + 8/ a. Then using some results of Hamana and Kesten on 
ip(k) = liniTv-^oo — h lnF(R N /N > k) 6J, we conclude that, if (3j = 0, for a 
small enough the number of constructed RW exceeds the total number of RW 
with n E [N(l + 20a), N(l + c 2 a)] and I n /n < h + c 2 a. Therefore (3j > 0. 
Outline of the proof of proposition 14. 2t F(Rn/N < e) is not exponen- 
tially small in (see e.g. and proof of (|3.1ajl ). Let J\ = [0, 1 — e) and 
J 2 = [1 — e, 1]- Since d{J\) > 0, W(N) Jl is exponentially small compared 
with W(N)j 2 for A^ large enough. Therefore for large N, W(N) ~ W(N)j 2 < 
\Q N \¥(R N /N < e), from which follows (JTOjl . 

The complete proof can be found at 
http://www-m5.ma.tum.de/pers/ferrari/homepage/download/appendix.ps.gz. 
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